A Deconvolutional Strategy for Implementing Large Patch Sizes Supports Improved Image Classification
نویسندگان
چکیده
Sparse coding is a widely-used technique for learning an overcomplete basis set from unlabeled image data. We hypothesize that as the size of the image patch spanned by each basis vector increases, the resulting dictionary should encompass a broader range of spatial scales, including more features that better discriminate between object classes. Previous efforts to measure the effects of patch size on image classification performance were confounded by the difficulty of maintaining a given level of overcompleteness as the patch size is increased. Here, we employ a type of deconvolutional network in which overcompleteness is independent of patch size. Based on image classification results on the CIFAR10 database, we find that optimizing our deconvolutional network for sparse reconstruction leads to improved classification performance as a function of the number of training epochs. Different from previous reports, we find that enforcing a certain degree of sparsity improves classification performance. We also find that classification performance improves as both the number of learned features (dictionary size) and the size of the image patch spanned by each feature (patch size) are increased, ultimately the best published results for sparse autoencoders.
منابع مشابه
A Design Methodology for Efficient Implementation of Deconvolutional Neural Networks on an FPGA
In recent years deep learning algorithms have shown extremely high performance on machine learning tasks such as image classification and speech recognition. In support of such applications, various FPGA accelerator architectures have been proposed for convolutional neural networks (CNNs) that enable high performance for classification tasks at lower power than CPU and GPU processors. However, ...
متن کاملDeep Deconvolutional Networks for Scene Parsing
Scene parsing is an important and challenging problem in computer vision. It requires labeling each pixel in an image with the category it belongs to. Traditionally, it has been approached with hand-engineered features from color information in images. Recently convolutional neural networks (CNNs), which automatically learn hierarchies of features, have achieved record performance on the task. ...
متن کاملDevelopment and Evaluation of a Real Time Site-Specific Inter-Row Weed Management System
ABSTRACT- A real-time, site-specific, machine-vision based, inter-row patch herbicide application system was developed and evaluated. The image resolution was 640 × 480 pixels covering a total area of 350 mm x 240 mm of a field composed of four quadrants of 350 mm x 60 mm each. The image frames were processed by LabView® and MatLab®. The developed algorithm, based on weed coverage ratio and seg...
متن کاملHighly Efficient Forward and Backward Propagation of Convolutional Neural Networks for Pixelwise Classification
We present highly efficient algorithms for performing forward and backward propagation of Convolutional Neural Network (CNN) for pixelwise classification on images. For pixelwise classification tasks, such as image segmentation and object detection, surrounding image patches are fed into CNN for predicting the classes of centered pixels via forward propagation and for updating CNN parameters vi...
متن کاملLearning Document Image Features With SqueezeNet Convolutional Neural Network
The classification of various document images is considered an important step towards building a modern digital library or office automation system. Convolutional Neural Network (CNN) classifiers trained with backpropagation are considered to be the current state of the art model for this task. However, there are two major drawbacks for these classifiers: the huge computational power demand for...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015